Collins-LA: Collins’ Head-Driven Model with Latent Annotation
نویسندگان
چکیده
Recent works on parsing have reported that the lexicalization does not have a serious role for parsing accuracy. Latent-annotation methods such as PCFG-LA are one of the most promising un-lexicalized approaches, and reached the-state-of-art performance. However, most works on latent annotation have investigated only PCFG formalism, without considering the Collins’ popular head-driven model, though it is a significantly important and interesting issue. To this end, this paper develops Collins-LA, the extension of the Collins’ head-driven model to support the latent annotation. We report its basic accuracy, comparing with PCFG-LA. The experimental results show that Collins-LA has potential to improve basic parsing accuracy, resulting in comparable performance with PCFG-LA even in the naive setting.
منابع مشابه
Head-Driven Parsing for Word Lattices
We present the first application of the head-driven statistical parsing model of Collins (1999) as a simultaneous language model and parser for largevocabulary speech recognition. The model is adapted to an online left to right chart-parser for word lattices, integrating acoustic, n-gram, and parser probabilities. The parser uses structural and lexical dependencies not considered by ngram model...
متن کاملCross-Lingual Syntactic Transfer with Limited Resources
We describe a simple but effective method for cross-lingual syntactic transfer of dependency parsers, in the scenario where a large amount of translation data is not available. The method makes use of three steps: 1) a method for deriving cross-lingual word clusters, that can then be used in a multilingual parser; 2) a method for transferring lexical information from a target language to source...
متن کاملTreacher Collins Syndrome
Treacher Collins syndrome (TCS) is a genetic disease that alters the development of bones and other tissues in the face, and presents variable expressivity. At least three genes TCOF1, POLR1D, and POLR1C were recognized to be at the origin of this syndrome which may be inherited through either an autosomal dominant or autosomal recessive pattern. TCS changes can be divided into otological, opht...
متن کاملHead-Driven Statistical Models for Natural Language Parsing
HEAD DRIVEN STATISTICAL MODELS FOR NATURAL LANGUAGE PARSING Michael Collins Supervisor Professor Mitch Marcus Statistical models for parsing natural language have recently shown considerable suc cess in broad coverage domains Ambiguity often leads to an input sentence having many possible parse trees statistical approaches assign a probability to each tree thereby rank ing competing trees in or...
متن کاملIntricacies of Collins' Parsing Model
This article documents a large set of heretofore unpublished details Collins used in his parser, such that, along with Collins’ (1999) thesis, this article contains all information necessary to duplicate Collins’ benchmark results. Indeed, these as-yet-unpublished details account for an 11% relative increase in error from an implementation including all details to a clean-room implementation of...
متن کامل